Mapping Soil Properties of Africa at 250 m Resolution: Random Forests Significantly Improve Current Predictions
نویسندگان
چکیده
80% of arable land in Africa has low soil fertility and suffers from physical soil problems. Additionally, significant amounts of nutrients are lost every year due to unsustainable soil management practices. This is partially the result of insufficient use of soil management knowledge. To help bridge the soil information gap in Africa, the Africa Soil Information Service (AfSIS) project was established in 2008. Over the period 2008-2014, the AfSIS project compiled two point data sets: the Africa Soil Profiles (legacy) database and the AfSIS Sentinel Site database. These data sets contain over 28 thousand sampling locations and represent the most comprehensive soil sample data sets of the African continent to date. Utilizing these point data sets in combination with a large number of covariates, we have generated a series of spatial predictions of soil properties relevant to the agricultural management--organic carbon, pH, sand, silt and clay fractions, bulk density, cation-exchange capacity, total nitrogen, exchangeable acidity, Al content and exchangeable bases (Ca, K, Mg, Na). We specifically investigate differences between two predictive approaches: random forests and linear regression. Results of 5-fold cross-validation demonstrate that the random forests algorithm consistently outperforms the linear regression algorithm, with average decreases of 15-75% in Root Mean Squared Error (RMSE) across soil properties and depths. Fitting and running random forests models takes an order of magnitude more time and the modelling success is sensitive to artifacts in the input data, but as long as quality-controlled point data are provided, an increase in soil mapping accuracy can be expected. Results also indicate that globally predicted soil classes (USDA Soil Taxonomy, especially Alfisols and Mollisols) help improve continental scale soil property mapping, and are among the most important predictors. This indicates a promising potential for transferring pedological knowledge from data rich countries to countries with limited soil data.
منابع مشابه
Mapping Dieback Intensity Distribution in Zagros Oak Forests Using Geo-statistics and Artificial Neural Network
The first and most important issue in forest drought management is knowledge of the location and severity of forest decline. In this regard, we used geostatistics and artificial neural network methods to map the dieback intensity of oak forests in the Ilam province, Iran. We used a systematic random sampling with a 250 × 200 m grid to establish 100 plots, each covering 1200 m2. The percentage ...
متن کاملRandom forests algorithm in podiform chromite prospectivity mapping in Dolatabad area, SE Iran
The Dolatabad area located in SE Iran is a well-endowed terrain owning several chromite mineralized zones. These chromite ore bodies are all hosted in a colored mélange complex zone comprising harzburgite, dunite, and pyroxenite. These deposits are irregular in shape, and are distributed as small lenses along colored mélange zones. The area has a great potential for discovering further chromite...
متن کاملHigh Resolution Mapping of Soil Properties Using Remote Sensing Variables in South-Western Burkina Faso: A Comparison of Machine Learning and Multiple Linear Regression Models
Accurate and detailed spatial soil information is essential for environmental modelling, risk assessment and decision making. The use of Remote Sensing data as secondary sources of information in digital soil mapping has been found to be cost effective and less time consuming compared to traditional soil mapping approaches. But the potentials of Remote Sensing data in improving knowledge of loc...
متن کاملNIR-Red Spectra-Based Disaggregation of SMAP Soil Moisture to 250 m Resolution Based on OzNet in Southeastern Australia
To meet the demand of regional hydrological and agricultural applications, a new method named near infrared-red (NIR-red) spectra-based disaggregation (NRSD) was proposed to perform a disaggregation of Soil Moisture Active Passive (SMAP) products from 36 km to 250 m resolution. The NRSD combined proposed normalized soil moisture index (NSMI) with SMAP data to obtain 250 m resolution soil moistu...
متن کاملSegmentation and sequential classification of a synthetized image composed of spatial environmental data for the compilation of a soil type map
A unified, national soil type map with spatially consistent predictive capabilities was compiled applying traditional and newly tested Digital Soil Mapping classification methods: segmentation of a synthesized image consisting of predictor variables and multi-phase, sequential classification by Classification and Regression Trees, Random Forests and Artificial Neural Networks. Object based clas...
متن کامل